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ABSTRACT 

This  paper  reviews  current  mathematical  and 
computer  methods  for  the  analysis  of  group  and 
organizational  structures.  Sociometric  data  collection 
methods,  the  detection  of  cliques  and  subgroups,  sociometric 
index  construction,  and  the  use  of  computers  in  sociometry 
are  discussed.  Clique  detection  methods  are  classified  as 
either  linkage  methods,  in  which  sociometric  data  are 
treated  as  linear  graphs,  or  as  distance  methods,  in  which 
data  are  treated  as  a  configuration  of  points  in  a  space. 
Several  linkage  and  distance  analysis  methods  are  discussed 
and  .compared.  It  is  suggested  that  some  of  the  methods  of 
numerical  taxonomy  could  be  applied  to  the  analysis  of  group 
and  organizational  structures. 
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Over  the  last  thirty-eight  years  sociometry  has 
become  one  of  the  standard  procedures  for  the  analysis  oi 
group  structures  of  communication,  influence,  ana 
affiliation.  Substantive  sociometric  work  is  beincr  don*^  in 
settings  ranging  from  industrial  plants  and  nospitals  to 
nursery  schools.  In  addition  to  investigators  wno  use 
sociometric  techniques  in  their  research  on  groups  ana 
organizations,  there  is  a  steadily  increasing  number  or 
workers  who  are  primarily  interested  in  the  invention  ana 
refineinenu  or  techniques  for  the  measurement  of  group 
structures. 

This  report  surveys  sociometric  methodology 
ratner  tnan  sociometric  investigation.  Research  in 
substantive  areas  nas  been  cited  to  illustrate 
:netnodolo.}ical  points  ratner  than  to  demonstrate  v/aau 
sociometric  research  nas  revealed  about  groups  ano 
orgcini  zations .  Lindzey  ana  ijyrne  (1966)  proviae  tne  best 
overall  review  or  sociometric  research,  wnile  Glanzer  and 
Glaser  (1959)  review  mathematical  tecnnigues  of  sociometric 
analysis  in  detail.  Guimaraes  (19o8)  provioes  an  excellent 
introuuction  to  the  mathematical  concepts  of  sociometric 
anciiysis  and  review  of  matrix  multiplication  in  sociometry. 

This  paper  benins  by  surveying  present  definitions 
of  sociometry  and  by  suggesting  some  extensions  to  these 
definitions.  Its  major  sections  are  organized  arouno  tv/o 
main  tasks  of  the  sociometric  investigators  collecting 
data;  viewing  the  group  as  composed  of  many,  possioly 
overlapping  subgroups,  and  computing  individual  and  group 
indices.  The  section  on  data  collection  considers 
"lues t ionnaires  and  •  rating  instruments,  but  not  direct 
observational  techniques.  Limited-  versus  unlimited-choice 
instruments  are  comoared,  ^no  ranking  techniques  are 
compared  v/ith.- airect  choice  and  rating  schemes. 

The  section  on  the  definition  of  sociometric 
subgroups  is  divided  into  two  major  parts.  The  first  deals 
witli  what  can  be  called  linkage  analysis  techniques,  in 
which  group  members  are  consioered  as  points  linked  by 
lines.  The  second  deals  witn  what  can  be  called  distance 
analysis  tecnniques,  in  v/hich  group  members  are  considerea 
as  ooints  distributed  through  a  geometric  space.  The  paper 
concluues  with  a  sf^ort  section  on  the  construction  or 
sociometric  inoices  ana  a  review  of  computing  techniques  in 
socio.netry . 

ri hy  soc  iometrv.   Sociometry  is-a  tecnnique  deeply 
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grounded  in  common  experience.  People  are  constantly 
evaluating  and  being  evaluated  by  their  acquaintances. 
People  and  their  acquaintances  are  bound  together  by 
patterns  of  friendship  and  association.  Groups  are 
•oerceived  as  having  centers,  peripheries,  ana  boundaries; 
individuals  as  isolates — people  marginal  to  our  own  or  other 
groups,  or  as  central  to  a  pattern  of  association.  Most  or 
the  metaphors  used  to  describe  networks  of  associations  have 
spatial  or  geometric  implications,  Moreno  has  often 
commented  on  the  sociogram's  immediate  impact  on  members  of 
the  group  it  represents;  evidently  a  representation  of  a 
group  in  a  two-dimensional  Euclidian  space  speaks  very 
strongly  to  people's  feelings  about  themselves  and  their 
associates . 

If  we  are  so  taken  by  these  immediate  geometric 
and  spatial  perceptions  of  social  structure,  then  our 
notivation  for  investigating  the  spatial  and  geometric 
aspects  of  group  structures  in  a  more  rigorous  form  is  easy 
to  explain.  Although  the  world  is  experienced  as  a 
three-dimensional  Euclidian  space,  there  is  no  reason  why 
the  structure  of  acquaintance  patterns  should  be  restricten 
to  this  form. 

The  rubric  of  sociometry  now  covers  a  broad  range 
of  investigative  techniques  and  a  multitude  of  sins.  The 
reviewer's  task  is  ■  not  so  much  to  provioe  a  single 
derinition  of  the  technique  as  to  assemble  a  set  of 
definitions  which  characterize  the  tasks  of  sociometry  anc 
vhich  will  cover  an  ever-broadening  set  or  techniques. 

Definitions  on  the  .  'asis  of  dota  content.  The 
first  use  of  sociometry  was  by  J,  L,  Moreno  in  the  1920s  ana 
was  popularized  in  his  book,  nho  Shal 1  Survive?  (1934), 
Moreno's  criteria  ror  a  sociometric  instrument  include:  tnat 
it  be  given  to  a  limited  group  of  respondents,  each  of  whom 
is  asked  to  indicate  his  preference  for  engaging  in  an 
activity  or  activities  with  other  members  of  the  group;  that 
there  be  an  unlimited  number  of  choices  and  rejections 
allowea  within  the  group;  that  the  choices  be  made 
orivately  and  anonymously;  and  that  the  questions  fit  tne 
jrouD'^s  level  of  understanding.  These  criteria  still 
describe  most  of  the  sociometric  research  done  today, 

Moreno  also  suggested  that  valid  results  woula  not 
be  obtained  unless  the  group  were  aware  that  some 
consequence  would  result  from  the  sociometric  study, 
liowever,  Mouton,  Blake,  and  Fruchter  (1960a,  1960b)  in  their 
reviews  or  the  reliability  and  validity  of  sociometric 
instrumentation,  observe  that  it  is  not  possible  to  obtain 
an  equivalent  retest  in  a  group  which  has  been  restructured 
as  a  result  of  a  previous  sociometric  study. 

Most  attempts  to  define  the  sociometric  methoo 
nave      concentrated   on   the   objects   of  measurement  ana  the 
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content  of  the  measuring  instruments.  Bain  (1943),  in  cin 
attempt  to  synthesize  the  concepts  of  sociornetry,  concluaeo 
that  sociornetry  "will  remain  a  generic  term  to  describe  a  11 
neasurement  of  societal  and  interpersonal  ddta,"  .lore 
recently  Bjerstedt  (19b6,  op.  15-23)  identified  thirteen 
definitions  of  sociornetry  which  he  submitted  to  a  panel  or 
269  experts  for  rating.  The  modal  response  among  the  131 
returns  was  tnat  sociornetry  is  the  quantitative  treatment  of 
''preferential  iriter-human  relations." 
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twe  may  think:  of  an  invest inat ion  as  dealing   witn 

or   entities,   wlucn   can   be  calleo  subjects  ano 


hach  cell  provides   a   olace   for   representing   a 

. between   one   subject   and  another,  ^ ne  object  ana 

another,  or  between  a  subject  and  dn  ooject. 

If  the  relation  describea  by  the  matrix  ..i  is 
symmetric  then  tne  matrix  will  be  symmetric,  i.e.,  the  encry 
in  cell  m(i,J)  will  be  equal  to  that  in  cell  m(J,i).  If  tne 
relation  is  not  symmetric,  then  following  Coombs'  (1964) 
usage  the  matrix  is  often  called  conditional,  for  the 
interpretation  of  the  difference  between  two  entries  in  a 
row  depends  on  the  particular  row  being  considered. 
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A  matrix  may  be  binary,  consisting  of  only  ones 
and  zeros,  or  it  may  contain  real  numbers  as  entries.  A 
binary  matrix  indicates  only  that  some  relation  either  does 
or  does  not  hold  between  two  entities,  while  a  real  matrix 
may  indicate  the  strength  or  type  ot  relation  between  them. 
.Vhere  repeated  or  multiple  measurements  are  made,  tne  data 
may  be  in  three-  or  higher-dimensional  form. 

These  properties  of  data  matrices  serve  to  link 
otherwise  disparate  fields  of  inquiry.  The  structure  or  a 
data  matrix  largely  defines  the  types  of  analysis 
appropriate  to  the  matrix,  ana  to  some  extent  the  sorts  of 
questions  it  can  answer.  Similar  data  matrices  tnus  serve 
to  unite  otherwise  dissimilar  fields.  (Coombs,  1964,  ana 
Shepard,  1972,  give  fuller  expositions  of  the  properties  or 
da td  mdtri  ces .  ) 

Sociometry  and  da^a  structure.  Sociometric  data 
naturally  falls  into  wnat  Coombs  (1964)  calls  an  intact 
matrix  form,  wherein  the  set  of  subjects  is  tne  same  as  the 
set  of  objects.  A  sociometric  data  matrix  may  be  binary  or 
roal,  but  it  is  almost  always  conditional  in  its  raw  rorm. 
These  characteristics  of  intactness  ana  condit ionali ty 
extend  the  definition  of  sociometric  data  and  relate  it  to 
other  studies  of  proximity,  especially  those  involving 
taxono.nic  analysis. 

So  far,  most  of  the  efforts  to  link  sociometry  to 
other  dreas  of  researcn  have  been  based  on  sharea  subjects 
of  investigation,  rather  than  on  similarity  or  researcn 
iiethoas.  Links  between  sociometry  ana  psychometr  ics ,  for 
instance,  hawe  come  about  as  investigators  i\ave  tried  to 
find  personal  correlates  of  sociometric  choice  status, 
i.  e.,  number  of  choices  received  from  others.  The 
'notivation  for  the  linkages  is  that  botn  techniques  are 
being  used  to  investigate  the  same  group  of  respondents. 
Thus,  sociometric  studies  have  been  combined  with  studies  of 
reputation,  acceptance,  status,  physical  environment,  etc. 
Cronbach  and  Gleser  (19b3)  review  methods  of  profile 
analysis,  another  way  of  viewing  the  sociometric  problem 
wnere  multiple-scale  instruments  are  used. 

Sociometry  can  be  categorized  in  three  ways.  To 
say  that  sociometry  is  the  study  of  preferential  internuman 
relations  indicates  the  content  of  many  sociometric  data 
sets,  but  not  the  aata  of  studies  or  functional  association, 
sucn  as  in  stuaies  of  invisible  colleges,  informal  working 
groups  of  scientists,  etc.  To  define  sociometry  in  terms  of 
conditional  intact  data  matrices  establishes  links  among 
sociometry  and  other  scientific  fields. 

There  are,  however,  several  studies  of  social 
structure  which  do  not  use  an  intact  matrix,  out  use  an 
off-diagonal  matrix,  in  which  groups  of  subjects  are   formed 
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on  the  basis  of  sharing  common  properties.  Phillips  and 
Conviser  (1973)  clustered  a  set  of  people  into  subgroups 
using  a  proximity  measure  based  on  their  shared  activities. 
A  reasonable  way  to  capture  this  aspect  of  sociometric 
investigation  is  to  say  that  soc iometricians  are  often 
interested  in  forming  clusters  of  people  on  the  basis  ot 
some  relation  among  them. 

Sociometric  Data  Collection  Methods 

In  the  years  since  Moreno  and  his  associates  first 
developed  the  original  sociometric  tecnnique,  many  new 
iietnoas  for  collecting  and  analyzing  sociometric  data  have 
)een  uevelooed.  Sociometric  questionnaires,  althouah  often 
concerned  with  interpersonal  preferences,  have  also  been 
used  to  investigate  working  habits,  sources  of  information, 
exclianje  of  help,  and  other  group  activities. 

Sociometric  questionnaires  usually  ask  for  direct 
choices,  for  rankings,  or  for  ratings  of  others  in  the 
jroup.  Each  type  of  questionnaire  can  be  further  classifieo 
into  tnose  with  a  limited  number  of  choices  and  those  whicri 
allow  an  unliiiiited  number  of  choices.  Respondents  may  be 
asked  co  list  the  names  of  people  they  woulo  like  to 
Hssocidte  with  or  to  share  some  activity  with,  or  may  oe 
presented  with  a  roster  of  group  members  ana  asked  to  rate 
or  rank  some  or  all  of  them  according  to  one  or  more 
criteria. 

Direct  c no i c e .  The  respondent-'s  original  task  was 
was  to  list  those  group  members  with  wnom  he  preferred  to 
engage  in  some  activity.  The  activity  was  generally  cnosen 
so  that  cnoice  of  partner  would  reflect  tne  chooser's 
overall  evaluation  of  the  chosen  person.  One  of  the  major 
variations  in  sociometric  technique  is  in  the  imposition  or 
a  limit  on  the  number  of  choices  the  responoent  may  make. 
Sometimes  the  limitation  of  choices  is  extreme,  as  where 
only  the  most-  and/or  least-preferred  associate  may  be 
listed  (rishbein,  196b).  More  commonly,  a  respondent  is 
asKed  to  rank  all  of  the  group  members  according  to  a 
criterion. 

There  has  been  much  debate  over  the  relative 
merits  or  limited  versus  unlimited  choices  and  of  forcea 
ranicings  as  investigative  techniques.  Eng  and  French  (1948) 
found  that  a  rank  order  derivea  from  paired  comparisons  of 
jroup  iiembers  correlated  more  closely  with  rankings  when 
there  were  unlimited  choices  rather  than  when  choices  were 
limitea.  Lindzey  and  Byrne  (1968),  in  their  extensive 
review  of  tiie  sociometric  literature,  concluaed  that 
unlimited  ciioices  reveal  more  information  about  the  status 
of  any  particular  member,  and  about  the  total  group 
structure  than  do  limited  choices.  However,  they  also 
concluue  the  exigencies  of  time,  statistical  analysis,  ana 
researcii  design  often  make  iL  oreferabl-e  to  use  a  limited 
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choice  technique,  Holland  and  Leinhart  (1973)  argue 
convincingly  that  limiting  the  number  of  choices  a 
responaent  is  allowed  to  make  distorts  the  group  structure 
as  revealed  by  the  data  beyond  the  ability  of  structural 
models  to  recover  it.  They  suggest  that  only  specializea 
unlimited-choice  procedures  will  recover  unbiasea 
sociometric  data. 

Despite  the  prevalence  of  ' limitea  cnoice 
instruments  in  past  research,  some  strong  arguments  can  be 
Tiacie  for  the  use  of  an  unlimited  number  of  choices  atia  or 
roster  instruments  which  present  each  respondent  with  a  list 
of  the  entire  group. 

Rating  scales.  One  of  the  major  reasons  for 
limiting  tne  number  of  choices  is  that  otherwise  tae  ranking 
task  becomes  unduly  difficult  and  time  consuming.  In  large 
jrouns,  especially  those  in  which  an  inoiviaual  may  have 
only  tne  most  limited  information  about  many  or  the  other 
members,  it  is  not  possible  for  a  member  to  malce  a 
meaningrul  rank  oraering  of  the  entire  group,  however, 
there  is  no  reason  why  so  difficult  a  task  should  be  imposea 
upon  the  respondent.  In  many  cases,  the  sorting  of  tne 
jrouo  into  a  limited  number  of  ordereo  classes  will  yield  a 
satisfactory  ouality  of  data,  while  at  tne  same  time  making 
ti^e  responaent's  task  easier. 

One  or  tne  most  common  ways  of  imposing  a  weak 
orderiiV}   on   a   group  is  the  use  of  rating  scales.   To  rate 

eacn  .nember  on  a  seven-point  scale  is  exactly  the  same  as  zo 
order  Che  group  into  seven  classes,  each  consisting  of  all 
iroup  members  who  receive  a  given  rating.  The  same  is  true 
for  such  sorting  techniques  as  the  Q-sort.  ,ihen  a  weak 
ordering  is  desired,  a  roster  instrument  nas  a  number  or 
advantdges  over  a  questionnaire  whicn  requires  tne 
respondent  to  write  in  a  stated  number  of  choices,  A  roster 
allov/s  each  respondent  to  be  presented  with  an  opportunity 
to  evaluate  every  otner  member  of  the  group.  If  tne 
respondent  is  free  to  make  as  many  choices  as  he  desires, 
eacn  entry  on  tl\e  roster  carries  information,  even  where  tne 
respondent  has  chosen  not  to  rate  a  particular  individual. 
It  is  safer  to  interpret  missing  data  in  a  roster  instrument 
as  representing  indiviauals  whom  tne  respondent  aia  not  know 
well  enough  to  rate,  especially  where  an  unlimited  number  of 
choices  is  involved.  This  may  be  esoecially  true  toward  the 
midale  of  a  rank  order  where  it  becomes  aifficult  to  assign 
discrete  ranks  to  those  who  are  simply  not  very  salient  on 
the  dii'iension  being  evaluated. 

A  roster  questionnaire  was  used  by  Roistacher 
(1971)  in  a  sociometric  stuay  of  high  school  boys. 
Roistacher  found  that  the  eignth  grade  boys  in  his  study 
responded  enthusiastically,  and  provided  data  of  hig^i 
quality.  Singleton  (personal  communication)  found  that 
third  grade  stuaents  were  able  to  read  the  names  on  a  roster 
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instrument  and  also  responded  well   to  a  request   to  rate 
classmates. 

Although  a  number  of  scaling  techniques  are 
beginning  to  be  used  in  sociometric  research,  by  far  tne 
'Tiost  common  method  for  generating  numbers  is  to  use  inaices 
derived  from  simple  ordinal  rating  scales.  bcales  on 
sociometric  instruments  have  had  from  two  to  as  many  as  one 
hundred  points. 

In  addition,  scores  for  sociometric  nominations 
have  been  constructed  from  sums  or  differences  between 
scales  from  the  same  instrument.  There  have  been  some 
attempts  to  use  versions  of  the  semantic  differential  (Uavis 
and  Warnath,  1957;  Lerner,  196b;  Fishbein,  1963,  1965),  but 
the  procedure  of  summing  ratings  on  the  various  scales  such 
as  good-bad  must  be  done  with  care,  since  it  is  easy  to 
obtain  "artifacts"  when  rankings  and  rating  scale  values  are 
treated  as  if  they  were  real  numbers. 

Other  methods  of  scaling  have  included  adjective 
check  lists,  (Davis  and  Warnath,  1957;  Lerner,  1965;  ana 
Mewcomb,  1961).  A  Q-Sort  was  employed  by  Peterson, 
Komorita,  and  Quay  (1964)  in  which  a  series  of  sociometric 
statements  were  placed  in  a  seven-point  forceo  normal 
distribution  in  terms  of  their  appropriateness  in  describing 
each  individual. 

There  have  been  few  successful  attempts  to  go 
beyond  the  use  of  rankings  and  simple  ordinal  scales  in  tne 
collection  of  sociometric  data.  Gardner  and  Thompson  (1956) 
constructed  a  set  of  scales  based  on  forced-choice 
comparisons  of  group  members  with  respect  to  their  capacity 
to  satisfy  certain  of  Murray's  needs.  Analysis  showed  that 
the  scales  so  produced  were  fairly  reliable  and  had 
approximately  equal  interval  scores.  Pepinski,  Siegel,  and 
Vanatta  (1952).  used  an  approximation  of  Thurstone-'s  method 
of  equal  intervals  to  measure  participation  in  a  group 
activity.  After  much  work  they  produced  a  group  of 
twenty-four  item,s  which  represented  eight  scale  positions 
witn  each  of  three  items. 

Finding  Sociometric  Cliques  and  Subgroups 

From  the  beginning,  sociometric  investigators  have 
been  interested  in  discovering  and  then  representing  the 
structure  of  a  group.  Group  structures  are  defined  in  terms 
of  individuals'  being  connected  by  stronger  or  weaker  links, 
or  being  in  some  sense  "closer  to"  or  "more  aistant  from" 
each  other.  A  major  research  task  has  been  to  divide  the 
population  into  cliques,  groups  of  individuals  who  have 
significantly  more  relation  to  each  other  than  they  do  to 
other  members  of  the  group. 
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The  original  relationship  was  individual  choice,  a 
clique  being  defined  as  a  group  of  individuals  each  of  whom 
prefers  the  company  of  at  least  one  other  member  of  the 
group  in  a  specified  activity  more  than  the  company  of 
anyone  outside  the  group,  Festinger,  Schachter,  and  Back 
(1950)  defined  a  clique  as  a  subsystem  of  three  or  more 
elements  in  mutual  interaction  with  each  other.  Luce  ana 
Perry  (1949)  formalized  the  graph  theoretical  notion  of  a 
clique  as  a  maximal  complete  subgraph,  a  set  of  completely 
linked  points  not  contained  in  a  larger  completely  linkea 
set.  Luce  (19b0)  generalized  his  definition  of  a  clique  to 
the  n-clique,  in  which  individuals  within  n  links  of  each 
other  were  treated  as  directly  connected,  ana  cliques  then 
extracted  from  this  n-graph. 

A  major  trend  in  sociometric  analysis  has  been  the 
generalization  of  group  membership  criteria  from  mutual 
choice  to  other  relationships.  In  general,  cliques  may  be 
defined  on  the  basis  of  any  relation  among,  or  attribute  of 
the  members  of  the  group  being  studied.  A  relation  is 
defined  between  two  members,  while  an  attribute  is  detinea 
on  a  single  individual. 

For  example,  a  group  of  scientists  could  be 
clustered  into  cliques  on  the  basis  of  how  frequently  they 
associate.  Alternatively,  they  might  be  clustered  according 
to  some  index  of  similarity  in  the  work  they  were  doing. 
Although  the  two  sets  of  cliques  might  have  quite  similar 
memberships,  the  former  set  of  cliques  is  aefined  on  the 
basis  of  a  bilateral  relation  while  the  latter  set  is  formed 
on  the  basis  of  a  shared  attribute.  The  former  example  is 
more  sociometric,  while  the  latter  example  is  traai tional ly 
a  problem  in  taxonomy, 

Most  clique  detection  methods  attempt  to  form  a 
partition  of  tne'  sociometric  group  i,  e,,  to  assign 
individuals  to  cliques  such  that  each  individual  is  a  member 
of  one  and  only  one  clique.  Partitions  are  seldom  obtained 
without  doing  some  violence  to  the  data,  since  a  group 
seldom  divides  into  completely  disjoint  subgroups,  Davis 
(1967)  found  that  groups  did  not  partition  into  two  disjoint 
cliques,  and  he  developed  structural  indicators  of  a  group's 
tendency  to  develop  cliques,  Davis  (personal  communication) 
later  suggested  that  partitioning  of  sociometric  groups  is  a 
false  goal,  since  partitionea  groups  are  as  rare  and  as 
suspicious  as  correlations  of  1,00,  A  real  partition  is 
usually  a  sign  of  conflict,  the  most  obvious  case  being  the 
partitioning  of  people  into  sides  in  a  war, 

Socioar ams .  The  original  method  of  analyzing 
sociometric  data  was  the  construction  of  a  sociogram,  a 
diagram  of  points  and  lines  representing  individuals  in  the 
sociometric  group  and  their  choices  of  other  group  members. 
The  sociogram  has  proved  to  be  a  compelling  way  of 
representing  group  structure,  especially  -  to  the  group-'s  own 
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members.  The  sociogram's  major  defect  is  that  its 
assumptions  are  not  explicit  and  that  it  seems  to  convey 
more  information  than  it  actually  carries.  It  is  used  as  a 
two-dimensional  picture  of  a  group's  structure.  The 
relative  position  of  points  represent  individuals' 
"positions"  with  respect  to  each  other  and  the  lines 
represent  their  choices  of  each  other.  Unfortunately,  there 
is  no  a  priori  reason  why  a  group's  structure  should  be 
representable  in  the  two-dimensional  Euclidian  space 
provided  by  a  sheet  of  paper.  In  general,  if  there  are  more 
than  three  people  in  the  group,  it  is  impossible  to 
represent  the  majority  of  the  mutual  relations  possible  to 
them  as  distances  in  two  dimensions.  Sociograms  suffer  from 
not  being  mathematically  "well  defined,"  i.  e. ,  there  is  no 
one-to-one  relation  between  a  set  of  sociometric  choices  and 
a  given  sociogram.  In  fact,  from  a  given  set  of  responses 
an  infinite  number  of  diagrams  may  be  constructed  which 
could  convey  widely  diverging  impressions  of  the  group's 
structure. 

A  number  of  workers  have  suggested  schemes  for  the 
construction  of  more  invariant  representations  of 
sociometric  structure.  Northway  (1940)  described  a  "target 
sociogram"  in  which  a  set  of  concentric  circles  is  used  as  a 
ground  to  indicate  the  choice  status  of  individuals. 
Overchosen  individuals  are  placed  toward  the  center  of  the 
target,  while  underchosen  or  low-rated  individuals  are 
distributed  around  its  periphery.  The  target  sociogram  is 
extensively  used  by  elementary  school  teachers,  who  seer.i 
quite  pleased  with  the  representation  of  choice  structures 
which  it  gives.  Borgatta  (1951)  suggested  that  sociograms 
be  drawn  so  as  to  minimize  the  number  of  crossed  lines. 
Techniques  such  as  these  may  prove  useful  when  adequate 
methods  have  been  developed  for  representing  group 
structures  in  graphic  form.  At  present,  there  seems  to  be 
more  payoff  in  developing  better  algorithms  for  representing 
social  structures  than  in  trying  to  draw  sociograms  by 
intuition.  Although  the  sociogram  is  universally  regarded 
as  the  best  final  representation  of  group  structure,  it  is  a 
poor  tool  for  determining  that  structure. 

Linkaoes  and  distances .  The  techniques  used  to 
determine  group  structures  from  sociometric  data  can  be 
divided  into  methods  of  linkage  analysis  and  methods  of 
distance  analysis.  The  model  underlying  distance  analyses 
is  that  the  set  of  people  can  be  regarded  as  a  set  of  points 
in  a  geometric  space,  some  of  which  are  "closer"  to  each 
other  than  others.  The  analytical  process  consists  of 
determining  the  dimensionality  of  the  space,  finding  (ana 
naming)  the  principal  dimensions,  and  determining  the 
configuration  of  the  points  in  the  space.  Tne  task  of 
finding  structures  and  subgroups  consists  of  constructing  a 
rule  for  determining  closeness,  and  determining  which  of  the 
points  (individuals)  are  mutually  close  enough  to  be 
regarded  as   cliques.    Factor  analysis,   multidimensional 
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scaling,  and  the  various  cluster  analysis  and  taxonomic 
methods  are  all  examples  of  distance  analyses. 

The  model  underlying  linkage  analyses  is  that  the 
sociometric  group  can  be  regarded  as  a  set  of  points,  some 
of  which  are  joined  by  lines  representing  a  relation.  The 
major  advantage  of  linkage  analysis  is  its  freedom  from 
assumptions  about  unknown  or  undefined  distances.  In  a 
distance  analysis,  there  is  a  distance  defined  between  any 
pair  of  points,  whether  that  distance  is  known  or  not.  In  a 
linkage  analysis,  points  either  are  or  are  not  joined  by  a 
line,  and  the  model  is  not  necessarily  incomplete  because 
some  points  are  not  connected.  In  a  distance  analysis,  two 
unrelated  points  may  be  described  as  infinitely  distant  from 
each  other,  but  often  this  is  "not  as  satisfactory"  as 
simply  being  able  to  say  that  the  points  have  no  relation 
defined  between  them.  Linkage  methods,  which  are  not  as 
common  or  as  numerous  as  distance  methods,  include  the 
graphic  analysis  of  sociograms,  the  statistical  analysis  or 
frequencies,  and  graph  theoretic  procedures. 

Linkage  Methods 

M dt r i X  permutation.  Since  the  work  of  Forsyth  and 
Katz  (1946),  almost  all  work  on  the  analysis  of  group 
structure  involves  the  matrix  representation  of  sociometric 
data.  Like  the  sociogram,  the  sociomatrix  is  not 
mathematically  "well  defined"  since  members  of  the  group  may 
be  arbitrarily  assigned  to  rows  and  columns,  however,  the 
number  of  matrix  representations  of  a  sociometric  data  set 
is  at  least  countable,  which  is  more  than  can  be  said  for 
the  number  of  possible  sociogram  representations  of  tne 
data.  The  earliest  work  on  sociomatr ices ,  by  Forsyth  ana 
Katz  (1946),  was  on  ways  of  rearranging  the  matrix  to  bring 
mutual   choices   closer  to  each  other  and  closer  to  the  main 

diagonal  of  the  matrix.  Beum  and  Brundage  ( 1 9b0 )  developed 
an  algorithm  for  bringing  the  matrix  to  this  canonicel  form 
and  Sorgatta  and  Stolz  (1963)  wrote  the  algorithm  into  a 
computer  program. 

Spilerman  (1966)  constructed  tne  most 
sophisticated'  of  the  matrix  permutation  techniques,  which 
eliminated  the  octopus-like  arms  produced  by  earlier  methods 
when  cliques  overlapped.  Spilerman  duplicated  individuals 
so  that  they  appear  in  simple  chains  with  everyone  to  whom 
they  are  linked.  The  duplication  makes  the  matrix  look  as 
if  it  is  composed  of  nonoverlapping  clloues.  Thus  all 
cliques  consist  of  adjacent  individuals,  so  that  the  area  of 
the  main  diagonal  is  tightly  packed.  Spilerman^s  strategy 
was  not  to  make  a  priori  definitions  of  cliques,  but  to 
rearrange  the  sociogram  so  as  to  allow  the  investigator  to 
make  better  a  posteriori  partitions  of  the  sociometric 
group.  Spilerman's  method  is  in  marked  contrast  to  the  more 
usual  strategy  of  attempting  to  determine  the  power  and 
generality  of  a  priori  rules  for  clique  detection. 
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Matrix  multiplication.    The   most  common  matrix 
operation   in  sociometric  analysis  is  matrix  multiplication. 
If  A  and  B  are  matrices,  and  C=AB,  then  c(i,j),  an  entry   in 
C,  is  defined  as 
c(i, j)=SUM(k=l ,n)  a(i,k)  X  b(k,j). 

Luce  and  Perry  (1949)  proved  several  theorems 
which  related  properties  of  matrix  powers  to  properties  or 
group  structures.  When  a  binary  sociomatrix,  C,  consisting 
only  ox  Is  and  Os,  is  multiplied  by  itself,  c(i,j)  contains 
the  number  of  sequences  of  length  2  between  members  i  and  j. 
Entries  in  the  diagonal  show  the  number  of  reciprocateu 
choices  made  by  a  member  of  the  group.  Similarly,  up  to 
k=3,  the  k-th  power  of  the  matrix  has  entries  containing 
sequences  of  length  k  between  members;  in  particular,  cells 
on  the  diagonal  contain  cycles  of  length  k,  i.e.,  sequences 
which  ooth  begin  and  end  with  the  same  person.  For  powers 
greater  than  three,  the  values  in  tiie  matrix  become 
periodic,  because  the  sequence  a,  b,  c  of  length  2  shows  up 
in  the  fourth  power  of  the  matrix  as  the  sequence  a,b,a,b,c 
of  length  4. 

These  clique  detection  methods  work  only  for 
nonoverlapping  cliques.  Luce  and  Perry  (1949),  and  iiarary 
and  Ross  (1957)  outlined  some  iterative  methods  for  finding 
multiple  cliques  in  sociomatr ices ,  but  these  methods  have 
been  superseded  by  computer  algorithms  (Augustson  ana 
Minker,  1970;  Purdy,  1973)  for  generating  all  of  tne  cliques 
in  a  graph,  even  where  cliques  overlap. 

Lingwood  (1969)  has  developed  a  technique  which  is 
in  the  spirit  of  matrix  multiplication  but  requires 
considerably  less  computation.  The  technique  consists  of 
beginning  with  a  single  member  of  the  group  and  tracing  all 
direct  connections  to  other  members.  A  "1"  is  then  placed 
in  the  vector  of  group  members  for  each  directly  connected 
member.  The  direct  connections  of  each  member  in  the  vector 
are  traced  and  a  "2"  is  entered  into  the  vector  for  each  of 
these  connections.  The  process  is  continued  until  all  those 
who  are  connected  to  the  starting  individual  have  been 
reached,  and  the  length  of  the  path  added  to  the  vector. 

The  most  elegant  matrix  multiplication  scheme  is 
Hubbell's  (1965),  which  uses  Leontief^s  (1941)  model  as  an 
analogue  for  communication  inputs  and  outputs  by  group 
members.  Leontief^'s  original  model  considers  N  factories, 
each  of  which  uses  the  products  of  itself  and  the  other  M-1 
factories  to  produce  a  product.  The  model  assumes  that  for 
any  given  level  of  inputs  a  factory  will  produce  a  given 
level  of  output,  and  the  N  factories  are  represented  as  a 
series  of  IJ  linear  equations.  Among  the  results  derivable 
from  the  model  is  that  given  a  set  of  initial  conditions, 
the  N  factories  will  reach  an  equilibrium  rate  of 
production,   in  which  the  total  amount  of -each  good  produced 
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will  equal  the  total  amount  of  that  good  consumed  by  the  n 
factories.  In  addition,  the  model  allows  the  effects  of 
changing  the  initial  or  boundary  conditions  to  be  predicted, 

riubbell  treated  the  giving  and  the  receipt  or 
sociometric  choices  as  analogous  to  the  inputs  and  outputs 
of  the  factories  in  Leontief's  model.  In  Hubbell^s  model, 
inputs  to  a  given  member  need  not,  as  in  the  Leontief  model, 
equal  the  output  from  that  member,  and  choices  may  have  botl^ 
positive  and  negative  weights.  Hubbell  stated  that  "where 
negative  lengths  are  present ... [c 3 1 ique-mates  can  ...  be 
allied  in  part  through  a  shared  antipathy  for  a  third 
party." 

Hubbell  assumes  that  status  is  the  equivalent  of 
an  output  in  the  Leontief  model.  Inputs  are  sociometric 
choices,  weighted  by  the  strength  of  the  choice  ana  tne 
status  of  the  chooser.  Inverting  the  choice  matrix,  tne 
equivalent  of  summing  all  powers  of  tl»e  matrix,  yielos  a 
matrix  whose  cells  are  contributions  of  member  J  to  tne 
status  of  member  i  and  whose  row  totals  are  statuses  for 
each  member  of  the  group.  The  inverse  of  the  matrix  is  more 
than  a  set  of  status  weights.  The  i,J-th  cell  of  the  matrix 
contains  the  effect  of  a  unit  change  in  the  status  of 
individual  j  on  the  status  of  individual  i  given  tne  present 
group  structure.  Thus,  the  inverse  of  the  matrix  can  be 
multiplied  by  a  vector  of  statuses  for  each  member  to 
investigate  the  dynamics  of  status  in  the  group. 
Partitioning  of  the  group  into  cliques  is  done  by  using  as 
the  measure  of  association  between  i  and  j  the  minimum  or 
the  contribution  of  one  to  the  status  or  the  otner,  i.e., 
!Tiin[a(i,j),  a(j,i)]. 

Hubbell  points  out  that  these  measures  or 
association  can  be  rank  ordered  and  a  cutting  point 
oarameter  chosen  at  a  level  which  will  eliminate  any 
specified  proportion  of  the  links  in  the  group.  Thus, 
cliques  of  varying  degrees  of  density  may  be  formed  and 
examined.  In  addition,  various  bounaary  conditions  may  be 
introduced  and  the  effect  of  boundary  condition  cnanges  on 
clique  structure  may  be  investigated. 

Mdt rices  .  atid  aiL^^Sj.  The  matrix  tecnniques 
discussed  above  represent  a  major  attemot  to  formalize  a 
oowerful,  but  largely  intuitive,  method  jor  describing 
social  structures.  The  classical  way  of  using  mathematics 
in  science  is  to  associate  things  in  tne  real  world  with 
mathematical  concepts,  to  operate  on  ti^e  concepts  using  tne 
content-free  rules  of  mathematics,  and  then  to  investigate 
the   relation  between   the   results   of   the   mathematical 

operations  .and  the  real  world.  In  the  case  of  sociometry, 
the  "things"  were  links  between  individuals,  and  trie 
mathematical  concept  was  the  matrix.  Tnere  lias  been  some 
degree  or  success  with  the  operation  of  matrix 
multiplication,   but   overall,  linkage  analyses  have  not  had 
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niMcn  relation  to  tne  booy  of   mathematics   in   which  ;natrix 
tneory  is  inberided. 

Graph  Theoretic  Methods*  Graph  theory  is  a  brancii 
of  'natheinat ics  concerned  with  abstract  conf inurations  oi 
points  and  lines,  narary,  Morman,  and  Certv/rigut  (196^) 
define  a  directea  graph  in  terms  or  four  primitives  anc 
four  axioms.   Tne  primitives  defining  a  graph  ares 


P\i    A  set  V  of  elements  called  "poinds.'' 

P2s  A  sec  X  of  elements  calleci'  "lines." 

t^3:  A  function  f  whose  domain  is  X  ano  Wiiosc  range 
is  contained  in  V. 

F4 :  A  function  s  whose  domain  is  X  and  wnosa  range 
is  containeo  in  V. 


The  axioms  defining  a  graph  ares 

AM  Tne  set  V  is  finite  and  not  eimpty. 

A2:  The  set  X  is  finite. 

A3!  I'o  two  distinct  lines  are  oarallel. 

A'l :  tii-^^re  are  no  loops. 

j\  grapn  looks  like  a  sociogram,  and  it  iias  '■)een 
suggested  tiiat  tneorems  derived  from  grapn  theory  iiiay  be 
useful  in  the  analysis  or  sociometric  oata.  bociograms 
which  differ  only  in  the  physical  placement  of  their  points 

regarded  as  different,  while  graph  theory 

any  representations  of  equal   numbers   of 

by   similar   a  rrangem.ents   of   lines   as 
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literature  of  operations  research,  where  it  is  used  in 
describing  flows  in  networks;  and  in  computer  science,  where 
it  is  used  in  describing  data  structures.)  Graph  theory  has 
developed  many  useful  definitions  of  configurations  oi 
social  structures.  Algorithms  have  been  developed  for 
locating  the  cliques  of  a  graph  (where  cliques  are  defineo 
in  several  alternative  ways),  and  for  finding  the  degree  of 
connectivity  or  fragmentation  of  given  parts  of  a  graph. 

Some  graph  theoretical  models  use  nu;nerical 
information  about  links  between  objects  to  generate 
networks,  which  are  graphs  whose  lines  .iiave  numerical 
values.  Networks  may  be  isomorphic  to  sociometric 
structures  (cf.  Ford  and  Fulkerson,  1962;  Hdrary,  i.'orman, 
and  Cartwright,  1965),  but  as  Gleason  (1969)  has  pointed 
out,  network  models  generally  require  that  the  quantity 
assigned  to  a  path  be  some  combination  of  the  quantity 
assigned  to  its  constituent  lines.  To  perform  mathematical 
operations  such  as  addition  on  these  numbers  often  requires 
stronger  metric  assumptions  than  can  safely  be  made  witn 
sociometric  data.  In  ract,  even  to  take  tiip  iiiaxi..iun  or 
minimum  value  of  a  set  of  lines  is  risky  where  it  requires 
comparison  or  ratings  across  subjects.  iJithout  additional 
Mieasur 3ment ,  it  is  impossible  to  say  that  a  likes  b  more 
tnan  c  likes  d  from  a  set  of  paired  comparison  data  or  ranK 
orders . 

The  two  major  applications  of  orapa  cheory  in 
sociometry  have  been  in  defining  cliques,  and  iii  definino 
relations  among  individuals  and  among  cliques  in  terms  or 
structural  balance.  The  first  graph  theoretic  definition  or 
a  sociometric  clique  was  suggested  by  Luce  and  Ferry  (1949), 
who  defined  a  clique  as  a  maximal  complete  subgraph,  whicl'i 
is  d  set  of  points  completely  connected  to  each  other  and 
iiot  part  of  a  larger,  completely  connected  set  of  points, 
fne  m.aximal  complete  subgraph  definition  is  rigorous  ano 
useful,  but  has  proved  too  strict  for  many  purooses.  Alba 
(1972)  used  Luce's  (1950)  invention  of  the  n-graph  to 
orovide  a  more  tractable  graph  theoretic  definition  for 
cliques.  An  n-clique,  according  to  Luce,  is  oi^e  in  which 
each  iiiember  is  connected  to  every  other  member  by  a  path  or 
n  or  rewer  lin-es.  Alba  introduced  the  additional  condition 
that  the  longest  .path,  the  clique-'s  diameter,  .iiust  be  or 
length  n,  for  otherwise,  members  of  an  n-clique  can  be 
connected  through  individuals  wno  are  not  clique  members. 

This  definition  is  more  inclusive  of  structures 
which  "look  like"  cliques,  but  do  not  meet  tne  strict 
requirements  of  Luce  and  Perry's  (1949)  original  definition. 
It  should  be  noted  that  none  of  this  ramily  of  definitions 
necessarily  partitions  a  group  into  cliques. 

i-iost  of  the  graph  theoretical  work  on  sociometric 
structures  has  involved  extensions  of  Heider''s  (1946) 
concept  of   structural   balance.    Balance-  theory   concerns 
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'sijnecj  graphs  representing  relations  between  people  an.,) 
objects.  The  lines  ot  the  orapn  can  take  on  the  Vdlnes  ''-••" 
an'i  .*-^!i  representing  positive  ana  neoative  relations 
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ix  Liiere  wer.'^  an  even  number  of  lines  witn  negative  si^^ns. 
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Davis  (1967)  pointed  out  that  a  balanced  graph  is 
an  ideal  type,  since  groups  consisting  of  only  two  opposed 
factions  are  seldom  found,  and  are  hardly  in  a  desirable 
state  of  affairs.  Davis  proved  a  set  of  theorems  giving 
conditions  for  the  partitioning  of  a  graph  into  two  or  more 
clusters.  A  cluster  is  a  set  of  points  connected  to  eacii 
other  by  positive  lines,  and  to  points  in  other  clusters 
only  by  negative  lines.  A  graph  which  can  be  split  inco 
clusters  is  known  as  clusterable,  and  the  partition  into 
clusters  is  known  as  a  clustering.  Davis^  clusterino 
theorems  are  based  on  the  condition  that  every 
cycle — undirected  path  beginning  and  ending  at  the  sa:.ie 
Doint — in  a  graph  must  contain  exactly  two  negative  lines  ir 
the  graph  is  clusterable.  If  the  graph  is  completo,  a 
unique  clustering  is  obtained;  otherwise  several 
clusterings  may  be  found.  Davis  gives  a  classification  oi 
tendencies,  which  may  have  analogues  in  social  relations, 
toward  cluster abil ity  in  incomplete  graphs. 

Cartwright  and  .  llarary  (1968)  related  tae 
partitioning  of  points  in  a  signed  graph  to  the  classical 
mathematical  problem  of  coloring  graphs.  They  defined  a 
graph  as  "colorable"  if  it  was  possible  to  partition  its 
joints  into  "color  sets"  in  a  fashion  siinilar  to  Jdvis' 
clustering.  Gleason  and  Cartwright  (1967)  aevelopeo  an 
alogrithm  which  determines  the  (possioly  unique) 
colorability  of  a  graph  from  its  adjacency  matrix. 

Peay  (1970)  nas  proposed  a  set  of   criteria   which 

would  allow  specification  of  degrees  of  colorability  an^ 
balance  in  grapns  rather  then  having  to  use  all-or-none 
criteria  which  could  be  violated  by  a  single  instance.  A 
triad  is  balanced  when  two  individuals  connected  by  a 
positive  path  (of  length  2)  are  not  connected  by  a  negative 
line.  Peay's  criterion  for  a  graph's  tendency  toward 
colorability  is  that  points  connected  by  positive  paths  or 
less  than  some  critical  length  not  be  connected  by  a 
negative  line.  A  colorable  graph  is  one  for  which  tne 
property  holds  for  a  path  of  any  lengtn.  Feay  devised  a  set 
of  tecnniques  for  determining  tendencies  tov/ard  colorability 
in  binary  and  scalar  matrices,  and  proposed  a  methoci  of 
statistical  evaluation  for  degrees  of  balance. 

The  graph  tneoretic  work  described  above  has  been 
concerned  with  structures,  such  as  cliques  and  clusters, 
which  extend  over  several  pairs  of  individuals. 

Even  though  the  definitions  of  cliques  and  balanced 
structures  have  been  extended  to  -include  n-cliques  and 
structures  with  tendencies  toward  balance,  the  probability 
of  colorability,  clusterability ,  or  balance  being  violated 
in  an  empirically  obtained  graph  is  considerable,  especially 
wnere  there  may  be  missing  data. 
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The  aifficulty  ot  lin'^inc^  extormer:  structures  ir, 
^'nplricci  lly  cieriveo'  graphs  [las  lea  to  an  riiteriiati  v-- 
aoproach,  which  is  to  view  a  oraph  as  composed  of  all  i cs 
nany  triads  oi  individuals,  ratlier  than  to  angreqdt.-^  i-iirs 
or  individuals  into  more  exten'Jep  structures.  A  cireci:-^j 
graph  of  iTiutuai  and  asymmetric  choices  can  be  co:npos8G  oi 
sixteen  types  of  triads.  wost  or  tiie  triads  are 
dist  invjuished  by  having  aifferent  combination.^  of  li;:3s 
connecting  their  members,  while  some  with  asymmetric  cnoices 
are  aisti  n<]uisiied  by  the  directions  or  the  asyrnaetr  ic 
choices,  iiolland  and  Leinhart  (1970)  reported  a  theore.n  ror 
computing  the  expected  numbers,  variances  and  covariancos  or 
the  sixteen  types  of  triads  in  a  graph  of  known  size,  giveii 
tne  observed  total  distribution  of  mutual,  asymmetric,  uu- 
null  choices  for  all  pairs.  This  distribution  provides  a 
means  of  statistically  evaluating  the  tendency  of  a  grapn  co 
deviate  rrom  ranuomness.  Holland  and  Leinhart  used  their 
rssults  to  evaluate  the  transitivity  of  relations  in  several 
graphs,  a  property  which  induces  oalanced  and  hi  erarcnicc-.l 
relations  as  special  cases. 

A  cnoice  relation  on  individuals  a,  b,  anJ  c,  is 
transitive  if,  wh.en  a  chooses  b,  and  b   chooses   c,   then   a 

chooses  c.  The  arithmetic  relations  denoted  by  "="  ana  "<" 
are  otner  examples  of  cransitive  relations.  beven  of  t.ie 
sixteei^  possible  triads  in  a  directed  grapn  are  intransitive 
while  the  other  nine  are  either  transitive  or  vacuously 
transitive.  A  triao  is  vacuously  transitive  wnen  it  does 
not  ruliill  the  "ir"  condition  tor  transitivity.  Iii 
particular,  a  triad  in  which  fewer  than  two  niembers  make 
choices  is  vacuously  transitive.  It  a  relation  or  no  cnoice 
between  a  pair  is  defined  as  equivalent  to  a  negative 
clioice,  tnen  a  colorable  graph  will  iiave  no  intransitive 
triads. 

There  are  logical  r  elat  ionsiiips  l^etween  the 
structure  of  a  graph  (macrostructure )  and  the  types  of 
triads  (microstructure)  which  will  be  found  in  it.  Flament 
(1963)  proved  that  a  balanced  grapn  will  have  only  balanced 
triads  in  it.  Other  models  of  social  structure,  such  as 
colorability ,  and  Davis  and  Leinhart's  (1970)  ranked 
clusters  model,  imply  that  particular  triads  will  be  absent 
from  a  graph.  In  a  perfect  case  of  conformity  to  a 
structural  type,  no  forbidden  triads  would  be  present.  The 
degree  to  which  the  prescribed  and  forbiodden  triads  are 
oresent  indicates  the  degree  to  which  the  oraph  conforms  to 
the  structural  model.  Davis  and  Leinhart  (1971)  constructed 
an  index  showing  the  extent  to  which  nonpermi ssable  triads 
were  present  in  a  graph,  but  had  no  way  of  testing  the 
statistical  significance  of  the  index.  The  theorem  of 
Holland  and  Leinhart  (1970)  allows  a  statistical  test  of  a 
graph's  conformity  to  a  structural  model, 

Davis  and  Leinhart  proposed  a  ranked  clusters 
model   of  social  structure,  in  which  cliqu-es  are  ranked  into 
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triad,  a,b,c,  the  fact  that  a  does  not  choose  b  shows  that  a 
and  b  are  members  of  different  cliques  at  the  same  status 
level.  The  same  is  true  of  b  and  c,  who  are  also  connected 
by  a  null  choice.  However,  the  choice  of  c  by  a  implies 
that  c  is  at  a  higher  status  level  than  a,  thus 
contradicting  the  effect  of  the  other  two  (null)  choices. 

Holland  and  Leinhart  (1970)  proposed  a  more 
general  model  of  transitivity  in  social  structure,  of  which 
the  Davis  and  Leinhart  model  is  a  special  case.  The  rankeo 
clusters  model  assumes  a  single  status  hierarchy.  rihere 
there  are  multiple  status  hierarchies,  the  triad  with  a 
single  asymmetric  choice  is  not  inconsistent.  A  common 
situation  in  which  there  are  multiple  status  hierarchies  is 
in  classrooms,  where  there  is  a  "sex  cleavage"  between  boys 
and  girls.  Holland  and  Leinhart  showed  that  tnere  was  a 
higher  than  expected  number  of  these  triads  in  sexually 
heterogeneous  groups,  and  a  lower  than  expected  number  in 
sexually  homogeneous  groups. 

Davis  (1970)  investigated  the  contribution  of 
families  of  triads  to  structural  balance  and  hierarchy  in 
graphs.  Davis  suggested  that  asymmetric  choices  migiit 
inaicate  relations  which  were  positive,  but  not  so  positive 
as  mutual  choices.  In  Davis  and  Leinhart's  work  on  the 
ranked  clusters  model,  certain  forbidden  triads  were  far 
more  prevalent  than  other  equally  forbidden  triads.  Davis 
investigated  the  effect  of  classifying  asymmetric  relations 
as  if  they  were  positive,  and  as  if  they  were  negative.  He 
found  that  the  classification  of  mutual  choices  as  positive 
and  mutual  null  choices  as  negative  was  consistent  with 
clusterability  in  all  groups.  In  small  groups,  a  better  fit 
to  the  clusterability  model  was  found  by  classifying 
asymmetric  choices  as  negative,  while  in  large  groups  a 
better  fit  was  found  by  classifying  them  as  positive.  Davis 
suggested  that  the  collection  of  data  in  a  small  group  was 
more  likely  to  be  one  of  unrestricted  choice,  while  in  a 
large  group,  restriction  of  the  number  of  choices  would  make 
a  null  choice  less  of  a  rejection. 

Davis  also  found  that  where  triads  consisted  of 
three  asymmetric  choices,  the  number  of  cyclic  triads  was 
less  than  woula  be  found  in  a  random  graph,  indicating  that 
v/here  there  is  a  hierarchy  of  choice,  choices  tend  to  be 
arranged  in  transitive  patterns  up  the  hierarchy. 
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Li nk  ag e  Frequency  iViethocis.  Some  of  the  earliest 
work  on  the  partitioning  of  sociometric  data  has  been  with 
relative  frequencies  of  choice  within  and  among  potential 
cliques.  An  excellent  introduction  to  the  statistical 
analysis  of  linkage  patterns  may  be  found  in  Moreno  et  al.-'s 
Sociornetrv  Re  ader .  After  a  false  start,  in  which  trie 
probability  of  one  person's  choosing  another  was  viewed  as 
an  independent  binomial  event,  more  realistic 
characterizations  of  tne  statistical  nature  of  the  socionrar! 
were  developed. 


Katz  and  Powell  (I960)  established . a  bai 
statistical  investigation  of  directed  graphs  by  ( 
the   sample  space  of  a  directed  graph.   They  stati 


Katz  and  Powell  (I960)  established .  a  basis  for  t.ie 

elaborating 
;ed  that  tne 
sample  space  of  a  graph  is  not  based  on  the  presence  or 
absence  of  a  single  line  betv;een  a  given  pair  of  points,  but 
is  rather  composed  of  sets  of  graphs  with  the  same  total 
number  of  points  and  lines,  or  of  graphs  with  the  same 
number  of  lines  arriving  at  and/or  leaving  a  given  point. 
This  work  makes  it  possible  to  evaluate  such  things  as  tne 
Jegree  to  which  a  group  contains  more  or  fewer  isolates  than 
would  oe  expected  by  chance. 

Most  of  the  statistical  work  on  grapns  nas 
involved  comparison  with  a  totally  random  graph,  a  somewnat 
rare  creature  in  real  life,  riowever,  it  seems  likely  tnat 
scatistical  metnods  worked  out  with  ranoom  comparisons  can 
also  oe  usea  to  evaluate  differences  between  nonrandom 
graphs,  once  good  graph  metrics  are  workea  out. 

Proctor  (196/)  gives  a  formula  ana  procedure  for 
estimating  the  variance  of  the  linkage  density  in  a  sample 
of  group  members.  Criswell  (1949;  cited  in  Proctor  ana 
Loomis,  1951)  constructed  a  chi-square  test  tor  determining 
the  relative  degree  of  ingroup  choice  of  a  subset  of  a 
sociometric  group. 

The  distributional  characteristics  or  sociometric 
networks  were  investigated  in  a  series  of  three  papers  oy 
three  overlapping  sets  of  workers  at  the  University  of 
'.iichiQan''s  Mental  dealth  Research  Institute.  The  data  for 
all  three  papers  is  a  set  of  up  to  eight  ordered  sociometric 
choices  made  by  each  of  9b6  stuaents  in  a  junior  l^igii 
school.  Rapoport  and  riorvath  (1961)  derived  a  distribution 
function  for  the  number  of  votes  an  individual  received  as 
n-th  (//here  n  ranged  from  one  to  eight)  friehd.  A  good  fit 
to  the  distribution  of  the  number  of  choices  as  n-th  frieno 
A/as  given  by  a  negative  binomial  distribution  function  whicn 
the  authors  derived  in  two  ways.  If  frienaship  choices  are 
considered  independent  events,  then  the  negative  binomial 
distribution  is  derivable  from  a  distribution  of  Greenwooa 
anu  Yule  (1920)  which  compounds  a  Poisson  distribution  for 
the  nu.aber  of  choices  received,  and  a  Pearson  Type  III 
distribution  for  the  expectea  number  of  ciioices  for  an 
individual.     riowever,    the   same    negative    binomial 
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distribution  function  is  derivable  from  a  Polya  process,  a 
time  dependent  model  of  contagion,  which  assumes  that  the 
probability  of  an  event  occurring  in  a  small  present 
interval  of  time  increases  linearly  with  the  number  or 
events  which  have  occurred  up  to  now,  but  decreases  overall 
with  tiie  passage  of  time.  Thus,  the  same  distribution  or 
votes  to  an  individual  occurs  when  votes  are  seen  as 
independent  events  or  as  stochastically  dependent.  Rapoport 
and  Horvath  showed  that  each  person  had  a  cnaracteristic 
"popularity  intensity"  which  influenced  tne  number  of  votes 
he  received  as  a  friend  at  any  level  of  choice.  There  v/as 
no  such  thing  as  a  person  who  was  characteristically  a 
"second  best,"  or  "n-th  best"  friend,  ratner  than  a 
generally  popular  or  unpopular  person, 

»Jork  begun  by  Rapoport  and  Horvath  on  tracing  tne 
sociometric  net  was  continued  by  Foster,  Kapoport,  ana 
Jr^vant  (1963).  The  tracing  process  consisted  or  randomly 
selecting  a  small  set  of  "starters,"  deciding  on  tne  number 
of  choices  to  be  traced  from  each  person,  and  adding  the 
names  of  previously  unchosen  persons  to  the  set  as  they  were 
reached  by  successive  generations  of  choices.  The  authors 
developed  a  recursion  equation  v/hich  gave  tiie  cumulative 
number  of  people  who  should  be  chosen  if  ciioices  were  made 
at  random.  The  authors  made  several  empirical  tracings  of 
their  lar-je  sociogram,  which  tl->ey  compared  with  the  data 
derived  from  a  hypoth-esis  of  a  random  network. 

The  empirical  tracings  showed  a  significantly 
S'nallor  rate  of  addition  of  new  people  than  dia  the  "random, 
net"  tracing.  Foster,  Rapoport,  and  Orv/ant  siiowed  tnat  the 
bias  in  the  empirical  tracings  could  be  assigned  to  two 
sources,  which  tiiey  called  a  parent  and  a  sibling  bias.  A 
narent  of  individual  A  was  defined  as  someone  wno  chose  A. 
Tv/o  individuals  were  defined  as  siblings  if  they  had  a 
common  parent.  The  bias  in  the  tracings  through  tiie  network 
were  found  to  be  attributable  to  tendencies  to  choose 
parents  and  siblings  at  a  higher  than  chance  rate.  Since 
parents  and  siblings  are  already  members  of  the  tracing, 
choices  to  them  do  not  result  in  the  aadition  or  new  members 
to  the  tracing.  The  authors  computed  the  parent  and  sibling 
bias  probabilities  in  their  empirical  data  and  developed  a 
new  recursion  equation  which  included  the  two  prooabili ties 
as  parameters.  A  good  fit  to  the  data  was  obtained  without 
having  to  include  a  parameter  ror  bias  resulting  from 
unreciprocated  choices  of  generally  popular  individuals. 

Foster  and  horvatii  (1971)  investigated  the 
distribution  of  reciprocated  choices  ano  suogested  tne 
probability  of  a  reciprocated  choice  as  a  measure  of  social 
distance.  They  developed  a  mooel  whicri  treated  the 
reciprocation  of  a  choice  as  a  sequential  process  in  which 
the  best  friend  is  chosen  first,  then  the  second  best,  etc. 
The  authors  developed  a  linear  equation  for  the  probability 
of  a  mutual  cnoice  between  an  i-th  and  a  j-th   best   friend. 
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Tae  probability  of  a  choice-^s  being  reciprocated  decreases 
as  eacii  person  gets  farther  away  from  his  choice  of  best 
friend.  The  two  empirically  derived  constants  in  the  linear 
equation  were  related  to  the  asymptotic  probability  of  being 
cnosen  at  all,  and  to  the  width  of  individuals-'  circles  or 
friendship. 

These  contributions  link  the  study  of  sociometric 
networks  to  the  study  of  generalized  stochastic  networks  and 
graphs,  work  which  is  also  pursued  in  physics,  communication 
theory,  and  information  theory.  The  treatment  of 
sociometric  choicees  through  a  network  as  a  stocnastic 
process  links  the  study  of  sociograms  to  the  stuoy  of 
solf-propagating  processes,  such  as  rumors  and  epidemics. 

Lingwood  (1969)  used  a  chi-square  statistic  to  test  the 
equivalence  between  cliques  determined  on  the  basis  or 
infornation  gatnering,  and  cliques  determined  on  the  basis 
of  research  methodology,  in  a  popolation  of  communication 
researchers.  Lingwood^s  hypothesis  was-  that  tiie  frequency 
of  choice  witnin  and  between  cliques  determined  by  one 
metliod  would  predict  the  relative  frequency  of  choice  within 
ano  bet'/ sen  cliques  determined  by  the  other.  Lingwood 
concluded  that  the  chi-square  [nethod  was  successful  once  tae 
cliques  had  been  discovered  by  other  methods.  Until  tne 
advene  of  computers,  it  was  far  easier  to  test  tne 
sigruficance  of  a  partition  into  cliques  than  it  was  to  make 
the  partition.  Trial-and-error  was  used  to  split  the  group 
so  as  to  maxiiiiize  the  ratio  of  within-clique  to 
between-clique  linkages.  There  now  exists  a  whole  array  of 
techniques  for  "hunting"  through  a  set  of  data  and 
subsetting  it  in  a  way  whicii  maximizes  some  measure  or 
optimality.   (E.  g.,  Sonquist,  1970) 

A'ork  on  the  distributional  characteristics  of 
networks  and  graphs  is  leading  to  tiie  development  of 
measures  of  difference  between  sociometric  structures  and  of 
measures  of  the  conformity  of  a  graph  to  a  structural  model. 
Proctor  (1960)  discusses  the  probabilistics  of  finding  a 
partition  for  .a  group  which  best  fits  the  group-'s  set  of 
choices.  Lioorman  and  Olivier  (1973)  derived  several  metrics 
on  a  space  of-  finite  trees,  which  may  be  used  for  such 
things  as  determining  the  relative  degree  of  similarity 
a^nong  a  sez  of  hierarchical  social  structures.  The  tree 
metrics  are  closely  related  to  partition  metrics,  which 
yield  f^easures  of  similarity  between  partitions  of  a  set. 

Distance  //lethods 

Distance  methods  of  analysis  are  characterized  by 
the  assumption  tnat  the  sociometric  group  can  be  represented 
as  a  set  of  points  distributed  through  a  geometric  space. 
The  idea  of  relations  among  individuals  and  groups  being 
expressed  in  terms  of  social  distance  is  a  plausible  one. 
In   .aduition,   there   is   a   large   and   well-known  body   or 
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i-natiiematics  dealing  with  points  and  spaces,  which  finds  wiae 
use  in  the  sciences.  The  main  philosophical  problem  in 
distance  analyses  concerns  the  validity  or  the  spatial 
metaphor  for  human  relations,  rafJner  than  the  validity  or 
mathematical  operations  once  the  spacial  analogy  has  been 
made.  Almost  all  statistical  methods  are  examples  oi 
distance  analyses. 

Some  properties  of  spaces.  A  space  in  its  most 
basic  rorm  is  a  set  of  objects  called  points,  and  is  defineu 
by  specirying  the  points  in  it.  For  instance,  if  we  arc 
observing  the  height,  weight,  and  sex  of  a  group  or 
subjects,  our  observations  define  a  three-dimerisional  space. 
iiacn  point  in  this  observation  space  is  a  triple  (n,w,s). 
Among  the  properties  of  this  observation  space  is  that  it 
:'ias  no  negative  regions,  since  no  one  has  a  negative  height, 
.'/eight,  or  sex.  Other  properties  are  that  two  of  its 
dimensions  are  expressible  as  positive  real  nu.nbers,  while 
the  taird  dimension  is  expressible  as  two  integers,  e.  g.,  0 
ana  I  . 

The  most  imoortant  forms  of  spaces  for  measurement 

purposes   are  metric  spaces,  on  which  a  metric,  a  measure  oi 

'listance,  has  been  defined.  A  metric  must  satisfy  three 
oroperti  es : 

it  must  be  eitner  0,  or  greater  tnan  0,  ana 
if  the  distance  between  two  points  is  0  then  they  are 
tne  same; 

a  metric  must  be  symmetric,  i.  e.,  the 
distance  from  A  to  B  must  be  the  same  as  the  distance 
from  3  to  A; 

a  metric  must  satisfy  tiie  triangle 
inequality,  i.  e.,  the  distance  from  A  to  B  plus  the 
distance  from  B  to  C  must  be  greater  than  or  equal  to 
tiie  distance  from  A  to  C. 

Any  function  of  two  points  in  the  space  satisfying 
tnese  chr'^e  criteria  will  serve  as  a  metric   on   tne   s.^-ace. 

ietric  spaces -are  important  because  they  are  civilized.  Mot 
only  GO  they  benave.  in  ways  similar  to  our  perceptual  space, 
but  they  are  mathematically  tractable.  Locations  ana 
distances  in  metric  spaces  tena  to  behave  as  we  expect;  a 
large  set  of  measurement  techniques  are  available  for 
application  to  a  set  or  points  in  a  metric  space.  Most 
statiscical  techniques  are  valid  only  for  data  which  ciefine 
a  i.ietric  space.   One  or  ti^e  most  important  tasks  in  dealing 

/itn  nonmetric  data  is  to  find  some  way  of  converting  the 
dita  to  .netric  form. 

Cons  ioeration  of   soine   of   the   partitioning   anc 

index  construction  methocs  reveals  that  mai~iy  measures  or 
association  will  not  satisry  all  of  the  ••requirements   of   a 
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metric •  In  most  instances,  the  raw  data  is  in  the  for'n  of 
rankings  or  other  ordinal  ,  measures.  In  the  majority  of 
sociometric  data  c(i,j)  and  c(j,i)  of  a  choice  matrix,  C, 
are  not  equal,  and  thus  violate  the  symmetry  property  of  a 
metric.  In  some  cases  a  function  of  the  two  cells  is  usee 
to  replace  the  values,  thus  symmetrizing  the  matrix. 

The  Euclidean  space  is  one  whose  metric  is  tne 
familiar  hypotenuse  rule  of  plane  geometry.  rjhere  a  and  b 
are  ooints  the  -distance  between  thein,  d(a,b),  is: 
d(a,b)  =  (a2+b2)V2, 

The  i£uclidean  metric  underlies  most  commonly  usee 
statistical  methods,  but  workers  in  the  areas  of  scaling  ano 
measurement  (Coombs,  1964?  Torgerson,  1953)  have  used  metric 
spaces  of  a  variety  of  types.  One  commonly  useo 
generalization  of  the  Euclidian  metric  is  ti\e  generalizeo 
.linkowski    metric    in   which   d(a,b)  =  (a''^+b"'^)  '^,  Tne 

properties  of  the  Minkowski  metric  can  be  alterea  by 
adjusting  the  value  of  r.  i(hen  r  is  set  equal  to  2,  then 
the  Minkowski  metric  is  a  Euclidian  metric.  i^hen  r  is  set 
to  1,  then  the  Minkowski  metric  becomes  a  "city  bloci-c" 
metric,  in  which  points  equally  distant  from  an  origin  form 
not  a  circle,  but  a  square  stanaing  on  one  corner. 

Many  techniques  for  partitioi^ing  sociometric 
iroups  into  cliques'  assume  that  group  members  may  be 
represented  as  poii^ts  oistributed  in  a  space  or  known  type 
and  metric,  but  of  unknown  dimensionality.  Individuals  are 
points,  and  the  entries  in  the  sociomatrix  are  perceiveo 
proximities  of  the  individual  points  to  each  other.  The 
oroblem  is  to  determine  the  configuration  of  the  individual 
ooints  in  the  space  and  to  decide  which  inaividuals  are 
sufficiently  close  to  each  other  to  be  regarded  as  a  clique. 

Until  recently,  factor  analysis  was  the  most 
commonly  used  distance  analysis  technique.  Factor  analysis 
has  been  augmented  by  nometric  techniques  such  as 
multidimensional  scaling  and  cluster  analysis,  which  use 
rank  orders  rather  than  raw  scores  as  input.  A  promising 
area  for  finding  sociometric  techniques  is  numerical 
taxonomy,  a  field  concerneo  with  generating  structures  from 
matrices  of  pairwise  association. 

Eactor  dn  al V 5  i 5 .  The  original  use  of  factor 
analysis  was  by  Bock  and  Husain  (1950),  who  converted  tne 
original   choice  matrix   into  a  matrix  of  correlations  ano 

oerformed  a  centroid  factor  analysis.  The  rationale  behino 
the  use  of  factor  analysis  is  that  a  clique  may  be  viewed  as 
a  set  of  individuals  who  tend  to  make  Similar  choices,  or, 
conversely,  that  a  clique  is  a  set  of  individuals  v/ho  tend 
to  be  chosen  by  similar  sets  of  others.  nhen  a  sociomatrix 
is  factor  analyzed,  each  clique  is  represented  as  a  factor, 
and  tne  dimensionality  of  the  space  is  equal  to  the  number 
of  cliques.    An   individual's   membership-  in   cliques   may 
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thererore   be   expressed   as   a  set  of  factor  loadings  whicii 
define  the  strength  of  his  association  with  each  clique. 

As  computing  techniques  evolved,  the  more  powerful 
varimax  factor  analysis  was  used  by  MacRae  (1960)  ana  by 
.Jriglit  and  !.:vitl:s  (1961)  in  the  analysis  of  choice  rTiacric3s. 
The  results  of  these  analyses  ars  soinewiiat  mixedi  as  there 
are  separate  sots  of  chooser  and  cnosen  cliques  which  must 
be  reconciled.  beaton  (1966)  extended  /,'ac.Rae's  '■lociel  by 
treating  the  chooser  ana  chosen  structures  as  two  batteries 
of  variables  und  by  performing  an  interbattery  factor 
analysis.  ilosanchuk  (1963)  compared  direct  factor  analysis 
;7itii  several  matrix  manipulations  and  sociogram  ceciinigues 
for  partitioning  sociometric  data  sets.  ne  found  that 
factor  analysis  proauced  a  better  recovery  of  o 
or  econstructe.i  sociogram  than  did  any  of  tiie  otlior 
tecnniques.  Tnere  have  not  been  any  published  renorL'" 
comparing  the  nonmetric  techniques  to  factor  analysis  in 
r^covnring  preconstructed  sociograms.  nowever,  nonmetric 
teciini  jues  have  proved  successful  in  reconstructing  otiier 
sorts  of  configurations  of  points,  especially  where  tao 
origindl  conf  icjurat  ion  was  subjected  to  a  nonlin-vjr 
transf  or-aat  ion  b<n'ore  being  recovered. 

r-ionmetr  i  c  sc  aljinq.  One  metlioci  of  analysis  wnicn 
overco.nes  many  objections  to  the  metric  assumptions  oi 
factor  analysis  is  noh.,ietric  multidimensional  scaling.  Tiiis 
C'^chnique ,  lilin  factor  analysis,  requires  as  innuc  a  matrix 
of  proxii:iities ,.  e,  n.,  a  correlation  matrix.  In  nonmetric 
scaling,  only  che  roiz-wise  rank  orders  of  the  values  in  tae 
■aatrix  are  used  in  the  analysis.  An  iterative 
tr ial-and-error  procedure  is  used  to  "Jiggle"  a  set  of 
ooints  in  a  'luclidean  space  to  satisfy  the  ranic  orders  oi 
oroximities.  The  result  of  this  procedure  is  that  a  set  oi 
■:)oliit5  in  a  space  of  unknown  metric  and  dimension  is  given 
an  analogue  of  points  in  a  -fuclidean  space  of  given 
"iii  lensionali  ty .  Any  or  a  number  of  oowerful  clustering 
teciniiues  may  tnen  be  applied.  (An  introduction  to  tne 
field  of  nonmetric  scaling  may  be  found  in  Jreen  ana 
Carmone,  1970,  A  somewhat  more  advanced  treatment  may  be 
found  in  Sl^eparo,  womney,  and  IJerlove,  1972.  Loth  of  these 
.7orks  contain  ■  extremely  complete  bil::^liographies  .  ) 

'Jleason  (1969)  used  iionmetric  multidimensional 
scaling  to  evaluate  aata  obtainea  by  Nev/comb  (1961). 
Jleason  converted  the  preference  data  of  a  group  of  college 
men  into  a  geoinetric  representation  with  a  nonmetric  scaling 
program.  The  points  tnus  obtained  were  then  subjectea  to  a 
.tuclidean  space  hierarchical  cluster  analysis  to  determine 
cliques.  The  problem  of  satisfying  the  symmetry  requirement 
for  a  metric  was  solved,  as  in  factor  analytic  methods,  oy 
representing  inoividuals  as  separate  chooser  and  chosen 
points,  since  the  choices  were  not  reciprocal.  Ueason 
sM-gestei  that  an  analysis  of  the  difference  between  an 
i.i-ividual''s  location  as   chooser   and   chosen  might   prove 
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useful,   but   at   present  this  disparity  has  not  yet  been 
related  to  other  characteristics  of  individuals  and  groups. 

Multiple  discr imin  ant  analysis.  Another  technique 
which  makes  strong  assumptions  on  the  space  is  the  use  of 
multiple  measurements  and  multiple  discriminant  analysis. 
Riffenburgh  (1966)  suggested  that  if  k  measurements  are 
made  on  the  individuals  in  each  of  p  groups,  each  individual 
may  be  regarded  as  a  point  in  a  space  of  k' dimensions .  The 
centroid  of  each  group  in  the  space  may  be  computed  ana 
distances  between  groups  determined.  By  making  some 
reasonable  assumptions  about  the  space,  confidence  intervals 
for  the  intergroup  distances  can  be  computed, 

Hiffenburgh  computed  a  set  of  sociometric 
distances  between  five  racial  groups  in  Hawaii  with  a  high 
degree  of  success.  He  pointed  out  that,  in  general,  p-i- 1 
sociograms  can  be  drawn  for  a  set  of  p  groups.  Each  group 
will  provide  a  complete  set  of  distances  for  the  sociogram 
as  well  as  contributing  to  an  overall  set  of  distances. 
Ri ff enburgh's  approach  takes  its  strength  primarily  from  its 
reliance  on  multiple  measurements,  which  allow  the  stronger 
metric  assumptions  which  justify  multiple  discriminant 
analysis;  his  procedure  seems  well-suited  to  situations  in 
which  multiple  measurements  can  be  taken  and  nas  one  of  tne 
most  elegant  measurement  bases  of  any  sociometric  method. 

Cluster  analysis.  To  the  extent  that  the  problem 
of  sociometric  analysis  is  seen  as  one  of  locating  and 
clustering  points  in  a  metric  space,  sociometry  becomes  a 
specialized  application  of  numerical  taxonomy.  Lorr  (1968), 
in  a  review  of  .the  literature  of  classirication  ana 
typological  procedures,  describes  four  major  types  of 
clustering  methods.  (Most  of  the  references  in  this  section 
are  from  Lorr's  1968  paper,  which  is  not  generally 
available. ) 

n'hat  Lorr  calls  "complete  linkage  analysis"  or 
"successive  cluster-  build-up"  uses  a  similarity  cutting 
point  criterion..  The  criterion  is  used  here  to  specify 
that  each  member  of  the  clique  is  closer  to  every  other 
member  than  he  is  to  anyone  outside  of  it.  Alternatively, 
the  centroid  of  the  clique  may  be  used  as  an  origin  for  the 
cutting  point.  The  criterion  for  single  linkage  analysis  is 
a  rigorous  expression  of  the  original  definition  of  a  clique 
as  a  group  of  people  who  preferred  or  were  closer  to  each 
other  than  to  others.  Single  linkage  analysis  thus  seems 
appropriate  as  a  way  or  partitioning  a  group  into  disjoint 
cliques. 

Single   linkage  analysis  uses  an  arbitrary  cutting 

point   of  similarity  to'  define  clique  membership.    Any 

members   of  the  data  set   whose  distance  is  less  than  the 

cutting  point  value  are  regarded  as   being   in  the   same 

clique.   Thus,  every  member  of  the  clique  -is  more  similar  to 
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at  least  one  other  member  of  that  clique  than  he  is  to 
anyone  outside  of  it.  Single  linkage  analysis  is  useful 
for  tracing  chains  of  individuals  and  produces  extended, 
serpentine  clusters.  Single  linkage  analysis  seems 
appropriate  for  analyzing  networks  which  connect  cliques 
once  some  idea  of  the  clique  structure  has  been  obtained  by 
a  complete  linkage  analysis,  (Fuller  treatments  of  single 
linkage  analysis  may  be  found  in  McQuitty,  1957;  Sokal  and 
Sneath,  1963;  Needham,  1961;  Parber-Rhodes  1961;  Cattell 
and  Coulter,  1966.  ) 

Successive  cluster  build-up  occurs  when  the 
cutting  point  is  allowed  to  increase  and  new  members  are 
added  to  the  clique  in  what  might  be  thought  of  as 
successive  layers.  (See  McQuitty,  1963;  Michener  and  Sokal, 
1957;  Sawrey,  Keller,  and  Conger,  1960;  Saunders  and 
Schucman,  1962;  Lorr  and  Radhakr ishnan,  1967;  Bonner,  1964.) 
Many  of  the  Euclidean  space  hierarchical  clustering  programs 
are  of  this  sort.  They  begin  with  a  matrix  of  Euclidean 
distances  between  points,  and  construct  centroids  by  taking 
root-mean-square  distances  between  individuals  already 
included  in  a  cluster. 

A  third  approach  is  to  partition  the  space  into 
subspaces  containing  subsets  of  points  which  are  as  tightly 
packed  as  possible.  The  algorithm  to  achieve  compactness 
attempts  to  minimize  the  mean  within-cluster  variance  of  the 
Doints,  shifting  points  between  clusters  to  do  so.  (See 
Thorndike,  1953;  Edwards  and  Cavalli-Sforza,  1965;   Ball  and 

liall,  1965;  McQuitty  and  Clark,  1968.) 

Cormack  (1971)  gives  a  very  6omplete  and  concise 
review  of  classification  methods,  including  consideration  of 
the  clustering  techniques  discussed  by  Lorr  as  part  of  a 
more  comprehensive  and  technical  discussion.  The  reader  is 
referred  to  Cormack'^s  extensive  bibliography  for  further 
reading  on  the  subject. 

Norman  (1967)  introduced  a  different  approach  to 
the  minimum-variance  clustering  oroblem,  using  a  technique 
which  partitions  the  entire  set  of  individuals  into  sets 
which  minimize  within-cluster  variance  while  maximizing 
between-cluster  variance.  The  partitioning  process  is 
iterated  to  yield  a  hierarchical  set  of  increasingly 
smaller  clusters.  Norman's  procedure  .  is  especially 
interesting  because  it  permutes  the  matrix  of  distances  in 
order  to  operate  on  submatrices,  and  thus  combines  aspects 
of  both  graphic  matrix  manipulation  and  computational 
methods  in  the  partitioning  of  data  sets. 
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.'Networks  and  covers* 

A  partition  of  a  set  requires  that  each  member  of 
tne  set  be  located  in  one  and  only  one  subset.  For  the 
sociometric  investigator  the  partition  has  been  a  useful 
goal  because  it  seemed  the  only  way  to  prevent  cliques  from 
blending  with  each  other  into  a  hopelessly  confused  mass. 
Spilerman  (1966)  used  a  matrix  manipulation  procedure  in 
which  individuals  were  duplicated  in  order  to  show  their 
simultaneous  membership  in  a  number  df  cliques.  Spilerman-'s 
procedure  illustrates  the  use  of  covers  ratner  than 
partitions  of  sociometric  sets.  A  cover  of  a  set  S  is 
defined  as  a  family  of  subsets  such  that  every  member  of  '6 
is  a  member  of  at  least  one  subset.  The  conditions  for 
covering  a  set  are  considerably  less  rigorous  than  those 
required  for  a  partition,  which  is  a  family  of  subsets  of  5 
such  tnat  each  member  of  S  is  a  member  of  one  and  only  one 
subset.  The  difficulties  of  using  a  cover  show  up  v/hen  a 
linkage  frequency  analysis  is  used.  In  general,  linkage 
frequency  methods  require  transitivity  of  the  linkage  in 
order  to  form  cliques.  If  C(l)  and  C(2)  are  two  cliques, 
ai^ci  i  is  linked  to  members  of  each  of  them  strongly  enough 
to  be  a  member  of  each,  then  transitivity  will  usually 
contraaict  the  assertion  that  C( 1 )  and  C(2)  are  disjoint. 

Guimearaes  (1968)  suggested  that  qualitative 
networks  could  provide  a  means  of  representing  sociometric 
structures.  A  network  is  usually  defined  as  a  graph  whose 
lines  have  numerical  Values,  which  may  represent  sucn 
variables  as  frequency,  channel  capacity,  or  intensity  ot 
linkage.  rlowever,  a  network  might  be  defined  as  a  graph 
witn  functional  as  well  as  structural  information.  A 
sociometric  group  could  be  connected  by  a  variety  or 
networks,  i.e.,  an  information  network,  a  friendship 
network,  etc.  It  might  be  oossible  to  use  a  clique 
detection  method  to  partition  a  single  network  in  a  very 
strong  fashion.  The  union  of  a  set  of  such  partitioned 
networks  would  form  a  cover,  since  networks  of  different 
kinds  would  overlap,  ■  even  where  cliques  of  a  single  network 
were  constructed,  so  as  not  to  overlap.  It  seems  appropriate 
to  use  logical  combinations  of  multiple  measures  before 
using  arithmetic  combinations,  since  logical  combinations 
make  far  weaker  metric  assumptions  of  the  data.  (For 
reviews  of  network  theory  see  Van  Valkenburg,'  1964.) 

Abstract  Algebras.  Boyle  (1969)'  used  concepts 
froiii  modern  algebra  to  investigate  clique  structures  without 
reference  to  metric  assumptions  about  the  data.;^"^  He  defines 
sociometric  choices  as  mappings  from  the  chooser  to  tne 
chosen  which  form  a  transitive  relation  on  the  set  of 
individuals.  By  making  some  assumptions  about  choices  in 
hierarchies,  e.g.,  that  choices  will  be  made  only  laterally 
or  to  an  indiviaual  one  level  higher,  he  was  able  to  make 
inrerences  about  the  structures  of  two  hierarchical 
organizations. 
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Construction  of  Sociometric  Indices 

After  analyzing  cliques  and  clusters,  a  seconu 
analytical  task  for  the  sociometric  investigator  is  to 
construct  scores  for  individuals  or  groups  of  individuals, 
(^e  may  think  of  scores  as  being  of  two  general  types*  a  set, 
I,  of  individual  scores,  and  a  set  G  of  group  scores.  The 
mappings  comprising  these  scores  may  be  specified  as 
J:  5  X  PS— >  R 
G:  PS  X  PS— >  R 

where  S  is  the  set  of  individuals  in  the  sociometric  group, 
and  PS  is  the  power  set  of  S,  the  set  of  all  subsets  of  S. 
The  individual  indices  are  functions  of  indiviouals  anc  oi 
subsets  of  the  sociometric  group.  Individual  scores  rerer 
explicitly  to  a  single  individual,  but  also  rerer  implicitly 
to  some  set  of  otlier  people  to  whom  tlie  individual  is 
related.  y\n  example  would  be  the  nuiaber  of  choices  aii 
individual  receives  from  the  group  as  a  v/hole,  or  from  so:ne 
subset  of  the  group.  Group  indices  are  runctions  of  a 
single  subset  of  the  group  and  of  one  or  more  other  subsets. 
An  example  would  be  the  proportion  of  group  Hicmbers  in  a 
given  group,  or  the  number  of  otiier  cliques  in  direct 
contact  with  a  given  group.  The  construction  or  an  index  is 
dependent  on  specifying  not  only  the  individual  for  whom  the 
index  is  to  be  constructed,  but  also  the  set  or  individuals 
in  which  he  will  be  embedded.  (Proctor  (1967)  gives  an 
examole  of  some  techniques  required  for  hypotnesis  testinfj 
when  data  are  derived  from  pairs  of  individuals  as  well  as 
from  single  individuals.) 


The  most  obvious  index  to  compute  for  an 
individual  is  his  row  or  column  total  in  the  sociomatrix, 
the  number  of  choices  he  gives  and  receives.  Several 
investigators  (e.g.,  Jennings,  1950;  Dunnington,  1957),  have 
separated  groups  into  relatively  highly  cnosen  versus 
relatively  unchosen  individuals,  using  a  variety  or  criteria 
to  construct  a  cutting  point.  Early  work  on  the 
construction  of  group  indices  led  to  the  construction  of 
sucii  indices  as  ,.ioreno^s  "einotional  exosnsiveness , "  tne 
ratio  of  total  ciioices  made  to  tne  total  number  of  ciioices 
possible  in  the  group.  Proctor  and  Loomis  (1951)  used  as  an 
index  of  qroup  cohesiveness  the  number  of  uiutual  cnoices  in 
the  group  divided  by  the  maximum  number  of  such  choices  in 
the  group.  The  same  authors  provide  a  large  variety  ot 
other  arithmetic  indices.  Many  similar  arithmetic  indices 
dave  been  constructed  by  a  number  of  v/orkers,  e.g.,  i^ornan 
(19i;3),  i'.lcKi  nney  (1948),  Smucker  (1949),  and"  Beauchamp 
(1965). 

Later  work  has  taken  into  account  not  merely  the 
direct  choices  between  individuals,  but  also  the  indirect 
linkages.  Katz  (1953)  used  the  su:n  of  successive  powers  oi 
the,  matrix,   together   with   an  attenuation  factor  for  eacn 
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power,  which  made  an  individual  status  a  weighted  sum  or 
choices  made  at  various  distances.  Taylor  (1969)  modified 
i<atz's  index  by  normalizing  it  according  to  the  number  or 
choices  that  the  person  gave,  as  well  as  received.  The  most 
elegant  form  of  this  type  of  status  index  is  in  Hubbell^'s 
(1965)  model  which  considered  choices  given,  choices 
received,  and  exogenous  variables. 

Lin  (1970)  has  presented  a  number  of  individual 
indices  which  are  easily  computed  but  which  make  strong 
assumptions  about  the  data's  level  of  measurement.  Lin 
begins  by  establishing  the  "influence  domain"  of  an 
individual,  defined  as  the  total  number  of  persons  who 
directly  or  indirectly  chose  that  person.  The  individual's 
centrality  is  defined  as  the  mean  distance  between  him  ana 
all  individuals  in  his  influence  'domain,  thus  treating 
linkages  as  equal  intervals.  Finally,  Lin  defines  t.ie 
nrestige  of  an  individual  ass 
P(i)=I(i)/(C(i)*(n-l )) 

if  C(i)^0;  P(i)=0  otherwise.  I(i)  is  the  size  of  i's 
influence  domain  and  C(i)  is  i's  centrality  index.  Tne 
Drestige  of  person  i  is  thus  defined  as  a  function  of  the 
number  of  persons  in  the  group,  the  number  of • persons  either 
directly  or  indirectly  choosing  i,  and  the  average  distance 
of  the  choosing  chains.  The  prestige  index  varies  from  0  to 
1,  equaling  0  v/hen  no  other  person  chooses  i,  ana  I  Yihe.n 
every  other  person  chooses  i  directly. 

Indices  are  an  important'  product  of  sociometric 
research,  especially  where  they  relate  to  optiinality  of 
structure  or  functioning  in  a  social  organization. 
Jnfortunat ely ,  most  indices  are  dd  hoc .  with  no  accompanying 
proofs  of  their  properties.  Since  few  socio:netric 
investigators  have  the  inclination  for  "riiroducing 
mathematical  proofs,  and  since  few  mathematicians  are 
acquainted  with  problems  of  data  collection,  it  appears  that 
better  indices,  must  arise  from  better  interdisciplinary 
partnerships. 


Computers  and  Sociometry 

The  computer  provides  a  inedium  for  organizing  and 
storing  sociometric  data,  for  editing  and  correcting 
sociometric  data  sets,  and  for  performing  sociometric 
analyses.  As  larger  sociometric  data  sets  ere  collected  and 
libraries  of  sociometric  data  are  assembled,  the  need  for 
explicit  standards  and  procedures  for  managing  sociometric 
data  iias  grown.  Small  data  sets  may  easily  be  kept  on 
punched  cards  and  managed  by  hand,  but  large  matrices, 
especially  ones  involving  multiple  measures  are  best  kept  on 
tape  or  disk  files.  A  problem  still  to  be  resolved  is  tne 
adoption  of  a  standard  format  for  sociometric  files,  since 
it  is  often  necessary  to  go  through  extensive  reformatting 
of  sociometric  data  files  in  the  course  of  using  analysis 
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programs  written  by  different  autnors.  bimildr  problems  or 
file  standardization  are  presently  being  discussed  by  thn 
developers  of  statistical  systems. 

Data  editing  and  form  dtting.  ::ost  socioMotric 
data  sots  begin  as  scalar  matrices  whica  are  transcribcsr 
from  their  original  format  to  punciied  cards.  One  of  tiio 
most  common  reasons  for  limiting  the  size  of  a  sociomatrix 
CO  less  than  eighty  is  so  that  an  inoiviaual's  choices  can 
be  stored  on  a  single  card. 

Such  matrices  can  be  manipulateci  on  the  coinputar 
througn  ti^e  use  of  standard  mathematical  pac/.ages  ano 
subsystems  which  allow  a  data  matrix  to  be  react  in, 
■symmetrized,  augmented,  totaled,  etc.  These  systems  have  no 
"sociometr ic"  options  as  such,  but  proviae  tools  which  tne 
user  iiiay  use  to  v/rite  his  ov/n  socioiietric  procedures. 
i^robably  the  most  common  of  such  systems  is  CA3IC,  uii 
inceractive  language  for  mathematical  calculation  founu  in 
the  libraries  of  most  timesharing  computer  systems. 
APL\360,  distributed  by  IBW  (1970)  has  a  more  powerful  set 
of  matrix  manipulation  procedures  than  does  BASIC,  but 
requires  a  special  keyboard  and  type  face  on  the  computer 
terminal.  Several  variants  of  APL  have  been  written  for 
both  liSii  and  other  computers,  which  do  not  require  a  special 
keyboard.  Speakeasy,  a  widely  available  system  developed  by 
Cohen  and  Vincent  (1968),  has  many  of  the  features  of  APL, 
but  is  somewhat  easier  for  the  new  user  to  learn  and  uses  a 
standard  keyboard.  Interactive  systems  such  as  these  are 
useful  for  performing  relatively  simple  analyses  of  small 
sociometric  matrices,  and  for  editing  and  fciiecking  small 
matrices.  Analyses  which  'require  operations  more 
complicated  than  tlie  computation  and  statistical  testing  or 
marginal  frequencies  require  specially  written  programs, 
most  of  which  are  interactive. 

Li n k  aci e  an  -alvsis  oroqr  dins.  The  most  complete 
sociometric  program  available  for  distribution,  anu  the  only 
one  approximating  a  sociometric  utility  system,  is  Albans 
(1972)  SOCK,  a  set  of  routines  for  subsetting  and  clustering 
sociomatrices .  The  system  reads  in  a  set  of  sociometric 
choices  and  eoits  them  into  a  matrix  in  standard  form, 
assigning  new  identification  numbers  to  individuals.  Tne 
Miatrix  is  then  decomposed  into  disconnected  groups,  each  of 
which  can  be  analyzed  separately.  SOCK  contains  routines 
for  generating  several  measures  of  proximity  and  pairv/ise 
association.    The   user   may   then  use   Johnson-'s    (1967) 

a 
;.iDSCAL 

ty 
matrices . 


hierarchical   clustering   algorithm  for  either  a  single  or 
complete  linkage  analysis.   Kruskal-'s  (1964a,  1964b)   i.lDSCj 
is  incorporated  into  SOCK  for  distance  analyses  of  proxini' 


COMPLT,  described  in  Alba  (1972),  is  a  program  for 
forming  the  adjacency  matrix  of  a  graph  and  extracting 
cliques  and  n-cliques  from  the  matrix.    »fhere   the   overlap 
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between  a  pair  of  extracted  cliques  is  greater  tnan  a 
specified  threshold,  the  program  combines  them  into  a  single 
clique.  For  each  subgraph  (clique)  thus  constructed,  the 
ratio  of  actual  lines  present  to  the  total  number  of  lines 
possible  in  the  clique  is  computed.  The  ratio  of  the  nun'f^er 
of  lines  joining  points  in  the  clique  to  lines  leddinc 
outside  of  the  subgraph  is  also  computed.  Both  ratios  are 
tested  against  a  random  graph  using  a  hypergeometr ic  test 
statistic  described  in  Alba  (1972). 

COMPLT  is  one  of  several  programs  which  extract 
maximal  complete  subgraphs  by  using  an  algoritnm  of 
Bierstone's  programmed  by  (Augustson  and'  Minker,  1970). 
iiowever.  Mulligan  and  Cornell  (1972)  showed  that  Augustson 
and  Minker^s  algorithm  was  wrong.  Purdy  (1973)  developed  an 
algorithm  for  finding  the  cliques  of  a  graph  which  runs 
twice  as  fast  as  any  previous  implementation  of  the 
Sierstone  algorithm,  and  presented  a  formal  proof  of  his 
algorithm's  correctness.  Any  programs  using  the  Bierstone 
algorithm  should  therefore  be  modified  to  use  the  improved 
Purdy  algorithm. 

DIP  (Directed  Graph  Processor)  oeveloped  oy 
oleason  (1971)  is  a  system  for  manipulating  graphs  in  matrix 
form.  DIP  performs  all  of  the  usual  arithmetic  and  logical 
operations  on  matrices  which  are  found  in  other  packages, 
i.  e.,  it  adds  and  multiplies  matrices,  and  extracts  ana 
augments  rows  and  columns.  In  addition,  DIP  computes 
c(istanc9  and  connectivity  matrices  for  graphs.  DIP  is 
related  to  SOCK  and  CO^.iPLT  in  that  it  is  a  system  for 
processing  graphs  in  matrix  form,  but  it  is  more  general  in 
its  application  than  either  of  the  other  two  programs.  In 
SDCK,  the  user  may  invoke  specific  programs,  such  as  the 
routine  for  generating  proximities,  ana  may  specify  program 
options,  such  as  v;hich  proximity  measure  is  to  be  generated. 
DIP  is  a  language  rather  than  a  set  of  programs.  The  user 
writes  his  own  program  for  an  analysis  using  the  DIP 
language.  DIP  is  superior  to  other  computing  languages 
because  graph  operations  are  built  into  it.  For  instance, 
the  user  may  obtain  the  distance  matrix  of  a  nraoh.  A, 
merely  by  writing,  "COMPUTE  DI5TA:JCE5  FOi^  A,"  rather  than  by 
specifying  the  long  sequences  of  arithmetic  operations 
involved. 

SOCPAC-I ,  developed  by  Leinhart  (1971)  decomposes 
a  directed  graph  into  its  constituent  triads"  and  tests  the 
frequencies  of  various  types  of  triads  against  a  random 
graph  with  the  same  number  of  null,  asymmetric , and  mutual 
links.  SOCPAC-I  tests  the  statistical  significance  of  triaa 
distrioutions  by  using  the  distribution  theorem  described  in 
iiollana  and  Leinhart  (1971). 

Dist  anc  e  analysis  proqr  jns.  kihile  linkage 
analysis  of  social  structures  requires  the  use  of  specially 
wri.tten   programs,   distance   analyses   can  often  be  aiaoe  by 
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using  sociomatrices  as  input  to  general  purpose  distance 
analysis  routines.  SOCK  uses  Kruskal^s  iMDSCAL-IV,  a  widely 
used  program,  for  performing  distance  analyses  on  matrices 
edited  by  other  routines.  A  review  of  factor  analysis, 
multidimensional  scaling,  and  taxonomic  and  clustering 
programs  is  beyond  the  scope  of  this  paper,  both  because 
there  are  so  many  programs  available,  and  because  the  rate 
of  development  Information  on  new  programs  for  sociometric 
analysis  are  published  in  the  program  abstracts  section  -or 
:3ehcjvior  dl  Science,  and  in  the  Newsletter  of  SIGSOC,  the 
opecidl  Interest  Group  on  Social  Science  Computin'j  of  the 
Association  for  Computing  Machinery. 

Inaex  construction  and  statistical  analysis. 
Once  tne  cliques  and  nets  in  a  sociomatrix  nave  been 
determined,  it  is  often  desirable  to  analyze  characteristics 
of  cliques,  either  at  the  individual  or  the  group  level. 
Such  cifidlyses  are  oest  performea  using  standara  statistical 
systems  such  as  OSIRIS  ( Interunivers ity  Consortium  for 
.-'olitical  Research,  1971),  SPSS  (ilie,  Bent,  ana  hull,  1971), 
or  DATA-TEXT  (Armor  and  Couch,  1971),  All  of  these  systems 
use  data  in  the  form  of  a  rectangular  matrix  each  row  of 
•/nich  contains  the  observations  for  one  case.  Each  colui.m 
of  tne  Matrix  represents  observations  of  one  variable  over 
all  cases. 

Data  obtained  from  spocialized  sociometric 
analysis  programs  can  be  added-  to  stanaard  statistical  files 
of  data  on  individuals  in  the  group.  For  instance,  a 
specialized  sociometric  system  could  be  used  to  obtain  a 
list  of  cliques  and  their  members.  The  cliques  can  be 
assigned  identification  numbers  and  each  individual  in  tiie 
group  tagged  '.vith  the  identification  number  of  each  clique 
of  whicn  he  or  she  is  a  member.  Tiie  clique  identification 
numbers  can  tiien  be  used  as  control  variables  in  standara 
statistical  analyses,  such  as  testing  the  hypothesis  that 
tne  mean  socioeconomic  status  'of  individuals  varies 
significantly  between  two  cliques. 

These  statistical  systems  all  have  programs  for 
aogregating  individual  records  into  records  at  the  clique 
level,  allowing  for.  the  testing  -of  nyootheses  concerning 
clique  indices,  such  as  size  or  connectivity.  These 
statistical  systems  can  also  accept  sociomatrices  in  either 
triangular  (symmetric)  or  square  form  for  input  to  their 
distance  analysis  programs. 
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Some  Unsolved  Problems 

There  remain  in  sociometric  analysis  several 
methodological  and  philosophical  problems  for  which  there 
are  at  present  no  satisfactory  solutions,  Sociometry  began 
as  a  geometric  problem  with  the  creation  or  a  sociogram,  out 
one  whose  basic  mathematical  attributes  were  not  in  douL^t. 
The  sociogram  is  a  geometrical  construction  in  a  two 
dimensional  Euclidean  space,  i.e.,  physical'  lines  drawn  on  a 
physical  sheet  of  paper.  The  matrix  representation  at  tirst 
seemed  to  overcome  the  geometrical  problems  in  reprosantirig 
sociometric  data.  It  becomes  more  and  more  clear  that  tne 
majority  of  successful  representations  of  sociometric  data 
require  the  assumption  that  a  sociometric  oata  set  consists 
of  points  distributed  throughout  some  kind  of  space.  -lOsc 
of  the  powerful  statistical  operations  sucn  as  factor 
analysis  and  multiple  discriminant  analysis  require  a 
Euclidean  metric  space,  a  very  difficult  commoaity  to  obtain 
in  most  psychological  measurement.  Sociometric  analysis  has 
inade  a  great  deal  of  progress  since  Moreno  and  see, as  to  oe 
maintaining  a  unity  of  ends  reached  through  a  variety  of 
means.  There  is  strong  agreement  that  the  graphic 
representation  of  sociometric  information  is  the  most 
powerful  and  the  most  easily  understooo,  but  at  present  tac 
matrix  representation  of  sociometric  data  provides  the  only 
viable  instrument  for  objective  analysis.  Progress  is  being 
made  both  on  the  problems  of  partitioning  ana  covering  a 
sociometric  •  data  set  and  in  the  construction  of  individual 
and  and  group  indices.  At  present  the  field  seems  to  be 
divided  between  those  who  assume  that  almost  any  ar itiiiaet ic 
operation  may  be  performed  on  sociometric  data  witii 
impunity,  and  those  who  will  make  no  assumption  beyond  those 
allowed  by  a  weak  order  relation  on  the  data.  The  forner 
workers  are  probably  making  mistakes  in  their  construction 
of  indices,  but  it  is  also  likely  that  the  "weak  order" 
people  are  not  making  best  use  of  the  available  data. 

Another  problem  is  that  of  interpersonal 
comparison  of  utilities.  If  a  and  b  are  two  objects,  and  i 
and  j  two  persons,  we  are  able  to  tell  if  i  prefers  a  to  b, 
but  we  are  unable  to  tell  whether  i  likes  a  more  than  j 
likes  b.  Such  interpersonal  comparisons  underlie  many  of 
the  arithmetic  procedures  used  to  partition  groups.  It  is 
possible  that  a  solution  to  this  problem  might  be 
approximated,  but  the  experience  in  price  theory  does  not 
indicate  that  we  should  be  optimistic. 

The  combination  of  sophistication  in  mathematical 
theory  with  increasingly  skillful  use  of  computing  power 
should  provide  a  viable  mixture  of  analytic  subtlety  ana 
brute  force  for  the  analysis  of  group  ana  organizaional 
structure.  In  addition,  'the  perception  of  sociometry  as 
closely  allied  to  taxonomic  and  classification  theory  shoulo 
result  in  the  finding  of  new  allies  in  other  disciplines. 
It  would  be  a  shame  to  have   psychologists-,   biologists,   and 
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mathematicians  laboriously  duplicating  each  others'  work  ana 

thereby   missing   the    chance   for  faster  progress   ana 
intellectual  partnership. 
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